LARGE SCALE BEHAVIOR OF SEMIFLEXIBLE HETEROPOLYMERS 

FRANCESCO CARAVENNA, GIAMBATTISTA GIACOMIN, AND MASSIMILIANO GUBINELLI 

Abstract. We consider a general discrete model for heterogeneous semiflexible poly- 
mer chains. Both the thermal noise and the inhomogeneous character of the chain (the 
disorder) are modeled in terms of random rotations. We focus on the quenched regime, 
i.e., the analysis is performed for a given realization of the disorder. Semiflexible models 
differ substantially from random walks on short scales, but on large scales a Brownian 
behavior emerges. By exploiting techniques from tensor analysis and non-commutative 
Fourier analysis, we establish the Brownian character of the model on large scales and 
we obtain an expression for the diffusion constant. We moreover give conditions yielding 
quantitative mixing properties. 

CJ 

Resume. On considere un modele discret pour un polymere semi-flexible et heterogene. 
n Le bruit thermique et le caractere heterogene du polymere (le desordre) sont modelises 

en termes de rotations aleatoires. Nous nous concentrons sur le regime de desordre gele, 
c'est-a-dire, l'analyse est effectuee pour une realisation fixee du desordre. Les modeles 
semi-flexibles different sensiblement des marches aleatoires a petite echelle, mais a grande 
i i echelle un comportement brownien apparait. En exploitant des techniques de calcul ten- 

Q^ soriel et d'analyse de Fourier non-commutative, nous etablissons le caractere brownien 

Q, du modele a grande echelle et nous obtenons une expression pour la constante de diffu- 

sion. Nous donnons aussi des conditions qui entrainent des proprietes quantitatives de 
l~i melange. 
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1. Introduction 



1.1. Homogeneous semiflexible polymer models. In the vast polymer modeling lit- 
erature an important role is played by random walks, in fact self-avoiding random walks 
(e.g. [HE]). However they are expected to model properly real polymers only on large 
scales. On shorter scales one observes a stiffer behavior of the chain, and other models 
have been proposed, notably the semiflexible one (see e.g. [9j [16] and references therein). 
A semiflexible polymer is a natural and appealing mathematical object and, in absence 
of self- avoidance, it has been implicitly considered in the probability literature for a long 
time. Consider in fact a probability measure Q on the Lie group SO{d) - the rotations 
in W 1 (d = 2,3,...) - and sample from this, in an independent fashion, a sequence of 

rotations n, r%, Fixing an arbitrary rotation R E SO(d) and denoting by e 1 , . . . , e d 

the unit coordinate vectors in M d , the process {v n } n >Q defined by 

v := Re d , v n := (R ri r 2 ■■■ r n )e d , n=l,2,..., (1.1) 
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is nothing but a random walk on the unit sphere S C K starting at t>o, a much studied 
object (e.g. [UJ 13]). Then the process {^°}n=o,i,... defined by 

n n 

is a homogeneous semiflexible polymer model in dimension d. The reason for writing 



(Rri Ti • • • r n ) instead of (r n r n _i • • • n -R) in (1.1 ) is explained in Remark 1.2 below. 



Remark 1.1. The reader can get some intuition on the process by having a look at the 
two-dimensional case of Figure [TJ This case is in reality particularly easy to analyze in 
detail (and it does not capture the full complexity of the d > 2 case) because the rotations 
in two dimensions commute and they are characterized by only one parameter. More 
precisely, if we identify the random rotation rj with the angle 8j, for j > 1, and we take 
6>o such that vo = (cos Oq, sin Oq), by setting ip n := Oq + Q\ + . . . + 9 n we can write 

(n n \ 

^cos(^),^sin(^) . (1.3) 

3=1 3=1 J 

This explicit expression allows an easy and complete analysis of the two-dimensional case, 
cf. Appendix [A] Of course, in general no such simplification is possible for d > 2. 

Homogeneous semiflexible chains have been used in a variety of contexts |16[ ITT] and 
they do propose challenging questions that are still only partially understood (even in 



their continuum version, see Remark 1.3), also because it is difficult to obtain explicit 
expressions for very basic quantities like the loop formation probability, i.e. the hitting 
probability. As a matter of fact, a more realistic model would have to take into account 
a self-avoiding constraint, which is more properly called excluded volume condition, that 
imposes that the sausage-like trajectory does not self-intersect. This of course makes the 
model extremely difficult to deal with. Added to that, models need to embody the fact that 
often real polymers are inhomogeneous, i.e. they are not made up of identical monomers 
and that this does affect the geometry of the configurations. It is precisely on this latter 
direction that we are going to focus. 

1.2. Heterogeneous models. Heterogeneous semiflexible chains have attracted a sub- 
stantial amount of attention (see e.g. [JJ QUI US EH E7J ) > often (but not only) as a modeling 
frame for DNA or RNA (single or double stranded) chains. The information that we want 
to incorporate in the model is the fact that the monomer units may vary along the chain: 
for the DNA case, the four bases A, T, G and C are the origin of the inhomogeneity and 
couple of monomer units have an associated typical bend that depends on their bases. The 
model we are interested in is therefore still based on randomly sampled rotations n, r2, • • ., 
independent and identically distributed with a given marginal law Q (this represents the 
thermal noise in the chain), but associated to that there is a sequence of rotations uji,uj2, ■ ■ ■ 
that is fixed and does not fluctuate with the chain. If we want to stick to the DNA exam- 
ple, the w-sequence is fixed once the base sequence is given. The model is then defined by 
giving once again the orientation vq = Re d G S^ 1 of the initial monomer and by defining 
for n > 

n 
X v n °' u := VQ + ^tf with vf := (Ru 1 r 1 ---u s r j )e d . (1.4) 

3=1 
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FIGURE 1. A sample bidimensional trajectory, with n = 10 4 and {0i}i=i,2,... 
drawn uniformly from (— ir/10, ir/10), while 6$ is (the notation is the one of 
Remark |1.1[ ) . In the inset there is a zoom of the starting portion of the polymer 
(the starting point is marked by the arrow). It is clear that the starting orientation 
Wo = (1,0) sets up a drift that is forgotten only after a certain number of steps. 
Moreover, even if the starting orientation eventually fades away, in the sense that 
the expectation of the scalar product of v„ and vq vanishes as n becomes large, the 
local orientation is carried along for a while. A precise meaning to this is brought 
by the key concept of persistence length i, that can be defined as the reciprocal 
of the rate of exponential decay of E(v ,v n ), where (•, •) denotes the standard 
scalar product in ~R d and E is the average over the variables {7°^},. Intuitively, one 
expects that on a scale much larger than the persistence length, the semiflexible 
polymer X%° is going to behave like a random walk. Note that if we view the 
elements of SO(d) as linear operators, we can define f := Eri (not a rotation 
unless n is trivial!) and we have ~E(vo,v n ) = (e d ,r n e d ), which shows that the 
decay of E(w , v n ) is indeed of exponential type. 



It should be clear that the rotation uj{ sets up the equilibrium position of the i-th monomer 
with respect to the (i — l)-st. In different terms, the sequence u±,U2, ■ ■ ■ defines the back- 
bone around which the semiflexible chain fluctuates. 

The aim of this paper is to study the large scale behavior of the process {A^ 0,aJ } n when 
the sequence u> is disordered, i.e. it is chosen as the typical realization of a random process. 
The simplest example is of course the one in which the variables uj n are independent 
and identically distributed, but we stress from now that we are interested in the much 



more general case when oj is an ergodic process (see Assumption 1.5 for the definition 
of ergodicity). This includes strongly correlated sequences of random variables and, in 
particular, the ones that have been proposed to mimic the base distributions along the 
DNA (e.g. [15] and references therein). Other aspects of this model deserve attention, 
notably the analysis of the persistence length in the heterogeneous set-up (see the caption 
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of Figure fil and other kind of scaling limits, like the Kratky-Porod limit (see Remark 1.3 ): 
these issues are taken up in a companion paper. 



Remark 1.2. Let us comment on the order of the rotations appearing in equations (1.4) 



and (1.1 ). The key point is the following consideration: in defining the rotations n and uii, 
we assume that the i-th monomer lies along the direction e . Therefore, before applying 
these rotations, we have to express them in the actual reference frame of the i-th monomer. 



Let us be more precise, considering first the homogeneous case given by ( 1.1 ). The rotation 
r\ describes the thermal fluctuations of the first monomer assuming that its equilibrium 
position is e . However the equilibrium position of the first monomer is rather vq = Re , 
therefore we first have to express t\ in the reference frame of t>o, obtaining Rr±R _1 , 
and then apply it to Vo, obtaining v\ = (Rr\ R^ 1 ) vq = (Rr\)e d . The same procedure 
yields V2 = (Rri) r2 (Rri)^ 1 v\ = {Rnrz) e d , and so on. The inhomogeneous case of 



equation (1.4) is analogous: we first apply u)\ expressed in the reference frame of vq, 
getting v' = (Ruj\ i? _1 ) vq = (Ru>i) e d , then we apply r\ expressed in the reference frame 
of v' , obtaining vf = (Ruii) r\(Ruii)~ l v' = {Ru)\ r\) e d , and so on. 

Remark 1.3. Most of the physical literature focuses on a continuum version of the ho- 
mogeneous semiflexible model, often called wormlike chain or Kratky-Porod model (e.g. 
|16j and references therein), which can be obtained in a large scale/high stiffness limit of 
discrete models. As for the discrete semiflexible model we had a discrete length parameter 
n that was in fact counting the monomers along the chain, here we have a continuous pa- 
rameter t > and the location Xt of the wormlike chain at t is equal to L BW(s)ds, where 
|b' '(s)} >n is a Brownian motion on S (e.g. [12]). Note that the initial orientation 

■uo is here replaced by the choice of B^^(O). For d = 2, once again, this process becomes 
particularly easy to describe since b( 2 '(£) = (cos(B(t) + xo),sin(B(t) + yo)), where B is 
a standard Brownian motion. We point out that in the physical literature the continuum 
model is just used for some formal computations and, in the heterogeneous set-up, the 
model is often ill-defined and in fact when simulations are performed usually one goes 
back to a discrete model [H QUI CEJ EH HZ] ■ 

1.3. The Brownian scaling. In order to study the large scale behavior of our model, 
we introduce its diffusive rescaling, i.e. the continuous time process BJ?' (t) defined for 

N e N and tN G N U {0} by 

B^(t) := "/^aT (1-5) 



This definition is extended to every t G [0, oo) by linear interpolation, so that B^'^^-) £ 
C([0, oo)) and it is piecewise affine, where C([0,oo)) denotes the space of real- valued 
continuous functions defined on [0, oo) and is equipped as usual with the topology of 
uniform converge over the compact sets and with the corresponding cr-field. The precise 
hypothesis we make on the thermal noise is as follows. 

Assumption 1.4. The variables ({r n } n >i, P) taking values in SO(d) are independent and 
identically distributed, and the law Q of n satisfies the following irreducibility condition: 
there do not exist linear subspaces V, W C R d such that Q(g G SO{d) : gV = W) = 1, 
except the trivial cases when V = W = {0} or V = W = Mr. 
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We point out that this assumption on Q (actually on its support) is very mild. It is fulfilled 
for instance whenever the support of Q contains a non-empty open set A C SO{d) (this is 
a direct consequence of the fact that an open subset of SO(d) spans SO(d)), in particular 
when Q is absolutely continuous with respect to the Haar measure on SO(d), a very 
reasonable assumption for thermal fluctuations (see S |3.1| for details on the Haar measure) . 
We stress however that absolute continuity is not necessary and in fact several interesting 
cases of discrete laws are allowed (e.g., for d = 3, when Q is supported on the symmetry 



group of a Platonic solid). Also notice that for d = 2 Assumption 1.4 can be restated more 
explicitly as follows: denoting by Rg G SO(2) the rotation by an angle 9, there does not 
exist 9 G [0,7r) such that Q({Rg,R w+ g}) = 1. 

Next we state precisely our assumption on the disorder. 

Assumption 1.5. The sequence ({w„} n >i,P) is stationary, i.e. {to n +i}n>i and {ui n }n>i 
have the same law, and ergodic, i.e. P({w n } n >i G A) G {0, 1} for every shift-invariant 
measurable set A C SO(d) . Shift-invariant means that {x%,X2, • • •} G A if and only if 
{x2, x$, . . .} G A, while measurability is with respect to the product cr-field on SO(d) . 

We can now state our main result. 



Theorem 1.6. If Assumptions 1.4 and |i.5| are satisfied, then ¥ '(do;) -almost surely and 
for every choice of vq the process B^'^ converges in distribution on C([0, 00)) as N — > 00 
toward o~B, where B = {{B\{t), . . . , Bd(t))} t>Q is a standard d-dimensional Brownian 
motion and the positive constant a 2 is given by 

I 2 °° 
a 2 := - + -^EE{eViri ■■■u k r k e d ), (1.6) 

fc=i 
where the series in the right-hand side converges. 

This result says, in particular, that the disorder affects the large scale behavior of the 
polymer only through the diffusion coefficient a 2 . Let us now consider some special cases 
in which a 2 can be made more explicit. Notice first that, by setting r := E(n), we can 
rewrite EE (e d ,uji r\ ■ ■ ■ u>k fk e<i ) = E (e d ,iOi f ■ ■ ■ uj k r e d ). 

• When f = cl, where I denotes the identity matrix and c is a constant (necessarily 
\c\ < 1), the expression for a 2 becomes 

1 9 °° 

a 2 - - + ^c fc E(e rf ,( Wl ..-^)e d >. (1.7) 



d d ^ 

k=\ 



Notice that the non disordered case is recovered by setting uji = I, so that the 
diffusion constant becomes 1/d + 2c/(d(l — c)). Assume now that c > and let us 
switch the disorder on: if we exclude the trivial case when F(iOie d = e d ) = 1, we 
see that the diffusion constant decreases, whatever the disorder law is. 

We point out that by Schur's Lemma the relation r = cl is fulfilled when the law 
of n is conjugation invariant, i.e., P(ri G •) = P(/in h^ 1 G •) for every h G SO(d). 

When the variables LO n are independent (and identically distributed), and with no 
extra-assumption on f, by setting uj := E(u>i) we can write 

'•-s + sI^^-^sK^')- <L8) 

fc=l x ' 
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Notice in fact that Assumption 1.4 yields ||r|| p < 1, where || • || op denotes the 
operator norm (see Section^), hence the geometric series converges. 

In the general case, the expression for the variance is not explicit, but of course it can be 
evaluated numerically. 

In order to get some intuition on the model, in particular on the role of the disorder 
and why it leads to (1.6), we suggest to have a look at Appendix Rl where we work out 



the computation of the asymptotic variance of X%° in the two-dimensional case, where 
elementary tools are available because SO{2) is Abelian. As a matter of fact, these el- 
ementary tools would allow to prove for d = 2 all the results we present in this paper. 
However, the higher dimensional setting is much more subtle and in particular the proof 
of Theorem 1.6 for d > 2 requires more sophisticated techniques: in Section [2] using tensor 



analysis, we prove that Theorem 1.6 follows from Assumption 1.5 plus a general condition 



of exponential convergence of some operator norms, cf. Hypothesis 2.1 below, and we then 



show that this condition is a consequence of Assumption 1.4. 



Remark 1.7. In the homogeneous case, i.e., when disorder is absent, our method yields 



a proof of the result in Theorem 1.6 under a generalized irreducibility condition that is 
weaker than Assumption |1.4| (see Appendix IB]) . This generalized condition is fulfilled in 
particular whenever the support of Q generates a dense subset in SO(d). We point out 



that this last requirement is exactly the assumption under which Theorem 1.6 (in the 
homogeneous case) was proven in [7J [T3] • 



1.4. On strong decay of correlations. The persistence length (cf. caption of Figure [I]) 
does characterize the loss of the initial direction, but from a probabilistic standpoint this is 
not completely satisfactory, since other information could be carried on much further along 
the chain. For this reason, we study the mixing properties of the variables vf (see ( |1.4[ )) 
and this leads to a novel correlation length, that guaranties decorrelation of arbitrary local 
observables. As we will see, we have only a bound on this new correlation length and we 
can establish such a result only for a resticted (but sensible) class of models. 

In order to state the result, let us introduce the cr-field T^ nn := o~(vf : m < i < n) for 
m E N, n E N U {oo} and for fixed oo. Then the mixing index a w (n) of the sequence {vf}i 
is defined for n E N by 

a"(n) :=sup{|P(AnS)-P(A)P(B)|: A E F?,m, B E ^ +n>00 , m E N}. (1.9) 

We work under either one of the following two hypotheses: 

H-l. The law Q of n is conjugation invariant, i.e., P(n E •) = P(/tri h^ 1 E •) for every 
h E SO(d), and for some no the law Q* n ° of (n • • -r no ) has an L 2 density with 
respect to the Haar measure on SO(d) (see j |3.1[ ). 

H-2. The law Q of r\ has an L 2 density with respect to the Haar measure on SO(d). 

Assumption H-fl]is sensibly weaker than H-J2] (of course on the conjugation invariant mea- 
sure), however requiring an L 2 density is quite a reasonable assumptions for thermal 
fluctuations. Then we have 

Proposition 1.8. Under assumptions H-l or H-2 there exist two constants C E (0, oo) 
and h E (0, 1) such that a w {n) < C h n for every n and every u> 
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The proof of Proposition 1.8 relies on Fourier analysis on SO(d): it is given in Sectional 



where one can find also an explicit characterization of the constant h (see ( 3.19|) ) . 



2. The in variance principle 



In this section we prove the invariance principle in Theorem 1.6 including the formula 



(1.6) for the diffusion constant, under some abstract condition, see Hypothesis 2.1 below, 



which is then shown to follow from Assumption 1.4 Throughout the section we set 

tp^n '■= u m r m (Vm+i r m+ i . . . uj n r n , m < n , (2.1) 

so that v - = Rifi a e (see (1.4)). We recall that vq = Re is an arbitrary element of S , 



with R G SO(d), and that Q denotes the law of r\. 

2.1. Tensor products and operator norms. Unless otherwise specified, in this section 
the vector spaces are assumed to be real (i.e., M. is the underlying field) and to have 
finite-dimension. The tensor product of two vector spaces V and W can be introduced for 
example by considering first the Cartesian product V x W and the (infinite-dimensional) 
vector space V x W for which the elements of V x W are a basis. Then the tensor product 
V (g> W is defined as the quotient space of V x W under the equivalence relations 

(v 1 + V2j XW~UiXW + U2XW, V X (wi + IU2) ~ V X W\ + V X VJ2 , 

c(v x w) ~ (cv) x w ~ v x (cw) , 

for c£K, vu\ G V and W(j) G W. The equivalence class of t> x w is denoted by f®w and 
we have the properties {v\ + V2) <8> u; = ui <8> w + v 2 <8> u>, w <8> (wi + W2) = v®wi+v®W2 and 
c(w <giw) = (cv) ®w = v® (cw). Given a basis {ui}i=i,..., n of V and a basis {u>i}i=i,..., m of 
W, {vi (8> Wjli.j is a basis of V <8> W, which is therefore of dimension nm. We stress that 
not every vector in V ® W is of the form v <£> w for some u G V, u; G TV. 

A more concrete construction of y<g>iy is possible in special cases, e.g., when V = W = 
C(M. d ), the vector space of linear operators on M rf (that will be occasionally identified with 
the corresponding representative matrices in the canonical basis). In fact C(M. d ) ®£(M. d ) is 
isomorphic to C(C(R )), the space of all linear operators on £(R ), and this identification 
will be used throughout the paper. Let us be more explicit: given g, h G £(K ), we can 
view g (g) h as the linear operator sending m G £(R ) to 

(g®h)(m) := gmh* , that is [(5 ® /i)(m)]y := ^ gikhjim k i , (2.2) 

fc,Z=l 

where (h*)ij = hji is the adjoint of /i. We are going to use this construction especially 
for g,h G SO(d), which of course is not a vector space, but can be viewed as a subset of 
£(R ). A useful property of this representation of g (g> h as an operator is that 

(gi®hi)(g2®h2) = (5-152) ® (/life) , (2.3) 

which is readily checked from ( |2.2[ ). Another crucial fact is the following one: given s\, S2 G 
C(M. d )*, the bilinear form (5, h) 1— > si(g)s2(h) can be written as a linear form s\ (8) S2 on 
the tensor space >C(M rf ) (8) £(1^), defined on product states 5 (g> /i by 

(si(8)s 2 )(5(8)/t) := si(5)s 2 (/i) (2.4) 

and extended to the whole space by linearity. This linearization procedure is the very 
reason for introducing tensor spaces, as we are going to see below. 
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Let us recall the definition and properties of some operator norms. Given a vector space 
V endowed with a scalar product (•, •) and an operator A £ £(V), we define 

lollop := sup l ^'^ 1 = sup l^i, P|| hs := y/Tr(A*A) , (2.5) 

v,w£V\{o} \\ v \\\\ w \\ Dev\{o} \\ v \\ 

where Tr(A) is the trace of A and A* is the adjoint operator of A, defined by the identity 
(w,Av) = (A*w,v) for all v, w £ V. If we fix an orthonormal basis {e t }i=i,... t n of V and 
we denote by Aij the matrix of A in this basis, we can write ||A||| S := ^2 i ■ |^4ij| 2 . It is 
easily checked that for all operators A, B £ £(V) we have 

H-A-ollop S ||-^||op ll-D Hop j ||-A||op < H-^-Hhs ) H-^-"l|hs S H-^-Hop ||-°||hs • l^-DJ 

In what follows, the space £(R ) is always equipped with the scalar product (i>,u))h s := 
Tr(v*w) = J2i j v ijWij. We can then give some useful bound on the operator norm of g®h 



acting on £(R d ): by (2.2) and (2.6) 



i „,,, \\gvh*\\ hs Hsllopll^llhs ,„ ,„_, 

\9<S>h\\ op = sup — j-j — j-j < sup n — n < Hsllop \\h\\op, (2.7) 

i>e£(IR d )\{0} IMIhs ue£(IR d )\{0} \\ v \\hs 



where we have used that ||/t*|| p — ii"-nop- 

Let us denote by T the orthogonal projection on the subspace of symmetric operators 
in C(R d ), defined for v £ £(R d ) by 

T(v) := l -{v + v*) , i.e. I»^ = 1 ( Vij + Vji ) . (2.8) 

Of course T £ £(£(R )), and for any linear operator m £ C(C(R )) we denote by m its 
symmetrized version: 

m := Tmr. (2.9) 

Note that g®p = (g®g)T = T (g <S> g), for every g £ £(M d ). 

Finally, consider s £ £(M d )* of the form s(g) = (v,gw), where v,w are vectors in R d 
with \\v\\ = || u; || = 1. For every linear operator m £ £(£(R )) we have 

(s(g>s)(m) = (s8s)(rm) = (s®s)(mr) = (s <g> s)(m) , (2.10) 

as one easily checks using coordinates, since (s (S> s)(m) = X^iM v « u j m ij,M w^ wi. It is also 
easily seen that 

|(s®s)(m)| < ||m[|op ■ (2.H) 

These relations are easily generalized to higher order tensor products: in particular 

s ®\ m ) = s ® 4 (m(r®r)) and \s m (m)\ < ||m|| op , (2.12) 

for every m £ C(R d ) m . 

2.2. An abstract condition. We are ready to state a condition on Q that will allow us 
to prove the invariance principle in Theorem |1.6| 

Let us consider Ey^ n , which is an element of £(M d ) (we recall that ip 1 ^ n is defined 
in ( |2.1[ )). We need to assume that, when k is large, E^ n+fc is exponentially close to the 
zero operator on R , uniformly in n. We are also interested in the asymptotic behavior of 
^[Pnn+k ® Pnn+k] ' which by (2.2) is a linear operator on £(M. d ): we need that, when k is 



large and uniformly in n, the symmetrized version E [ ipP , fc <g) ^ n+k ] of this operator, 



cf. (2.9) and (2.8), is exponentially close to the linear operator II defined as the orthogonal 



DISORDERED SEMIFLEXIBLE POLYMERS 



projection on the one dimensional linear subspace of £(R ) spanned by the identity matrix 
{h)i,j = Sij, 1 < i,j < d, that is 



U(v) := -.Tiiv)I d , ve£(R d ). 
a 



(2.13) 



The reason why the operator II should have this form will be clear in £2.5 Let us now 
state more precisely the hypothesis we make on Q. 



Hypothesis 2.1. The law Q ofn is such that, forP-almost every uj, we have 

!oo oo ^ 

EH E [<n +fc ]|| op + £H E Kn+*®<n +fc ] ~ H || < OO . 

fc=0 fc=0 J 



(2.14) 



The next paragraphs are devoted to showing that Theorem 1.6 holds if we assume Hypoth- 
esis 2.1 together with Assumption 1.5 We then show in |2.5| that Hypothesis 2.1 indeed 



follows from Assumption 1.4 



2.3. The diffusion constant. We start identifying the diffusion coefficient a 2 , given by 
equation ( |1.6[ ). For any M rf - valued random variable Z we denote by Cov(Z) its covariance 
matrix: Cov(£/)jj = cov(Z l , Z- 7 ). 



Proposition 2.2. If Hypothesis \2J\ and Assumption 1.5 hold, then for ¥ -almost every u 
and for every vq E S d ~ l we have that 



where 



lim -Covp(A^) = a 2 I d , 

n—>oo n 



ct2 = M 1 + 2 E ee ^<^))' 



(2.15) 
(2.16) 



the series in the right-hand side being convergent. 



Proof. By a standard polarization argument it is enough to prove that for any v E S d 



-l 



lim - varp((v,X^' w )) = a 2 

n— >oo n 



because 



covp (<^X^),^,JC>->) = varp 



e l + e^ 






cj)-a.s. 



varp 



(2.17) 



e — e J 



X V 0r 



V ,LxJ 



(2-18) 

We recall that Xn 0,UJ = vq + J2k=i -^Vi fe e<i ^ wriere we se t v o = Re d for some R E SO{d). 
For notational simplicity, we redefine X^ 0,w := Xn '^ — vq for the rest of the proof (notice 
that this is irrelevant for the purpose of proving (2.17)). Introducing the notation 

s v (g) := {v,Rge d }, for g E C(R d ) , (2.19) 

we have the simple estimate 



\E(v,x v n °n 



E e <a 



\k=l 



< EII E ^i. 

ken 



u\\ < OO, 

K Mop 



(2.20) 
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by Hypothesis 2.1 This shows that, in order to establish (2.17), it is sufficient to consider 

n n—k 

E[(v,X%»») 2 ) = J> [(*,(riU)) s 



+ 2 E E E ^ Wli) *v Ki+k)] ■ (2.21) 



fc=l 



k=l 1=1 



By {[274]) and ( |2.10[ ) we can write (s„(^ fc )) = (s„ (8) s t) )(99 £ J J fc <g> ip^ fe ), and by ( |2.11[ ) 



together with Hypothesis 2.1 we can rewrite the fist sum as 

n / n 



£E[M^ fe )) 2 ] = S f p[^8^] = nsf(U) + O(l). (2.22) 



fe=i 



\fc=i 



In the same spirit the control the off-diagonal terms. We first observe that by (2.3) 

<Pl,l ® <Pl,l+k = <fl,l ® K« ¥?+i,i+fc) = «/ ® V?i,0 (Id ® ¥>M-i,n- fc ) , (2-23) 



where Id G £(R ) is the identity operator. Then by ( |2.4[ ) and ( |2.10 ) we can write 

fl«K,)a«K, +fc ) = s? 2 (r^ jfc ®^, +fc ) = sf((^®^)(/ d ®^ +v+fc )) . (2.24) 



By Q2.11| ), ( p?7| and ( |2l| we then obtain E[s v (tpf {) s v (ipf l+k )] < \\E<pf +ll+k \\ , hence 



by Hypothesis 2.1 it is the clear that 



1 n n—k 

lim limsup- V V E[«„(^,) s v ((f% l+k )] = 0. 

m— >oo n-,00 n 

K=m 1=1 

This allows us to focus on studying the limit asn->oo and for fixed k of 

n— k , n—k 



(2.25) 



i X^M^KKwJ] = I E^( E [^^y](^® E *i,w))' (2-26) 
i=i 1=1 

In this expression we can replace E [ tpf^^ <S> tf^i ] by its limit n by making a negligible error 



(of order 1/n), by Hypothesis 2.1 Furthermore, by the Ergodic Theorem 

1 n—k 

lim -J>® 2 (n(/ d ®E^ +M+it )) = S f (n(J d ®EE^)), P(du,)-a.s.. (2.27) 

) — >nr) r? ^ — * 1)1/ v ! / 



n^oo ri 



/=1 



We have therefore proven that P(do;)-a.s. 



1 



lim - var P {v,X^ w ) 

n— >oo 77, 



S f (H) + 2 J>f (n(I d ®EEtf )fc )) . (2.28) 

fe=i 

Let us simplify this expression: by (2.13 ) the representative matrix of n is Hij^i = 3 $ij $kl 

1 



and by (2.19) we can write s v (g) = '^2 rn (R*v)mg m d, hence 



s f(n) = J2( R * v h( R * v )j u ^dd = \\\ie 



d' 



(2.29) 



because R G SO{d) and u is a unit vector. The second term in the right hand side of (2.28) 
is analogous: setting for simplicity m := ¥,E(pf +l l+k G C(M. ), the matrix of the operator 
Tl(Id ® tri) is given by 

d -, d 



[U(I d <g> m)]ij,ki = ^2 n y>° 6 ( Id ® 



m) a hki 



a,b=l 



- ^2 5ij 5 ab 5 a k mu = -z 5ij m k i , (2.30) 



a,b=l 
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hence 



sf[n(/d«m)] = J2( R * v )i(R*v)j[Wd^m)] ij4d = -m dd . (2.31 



d 



Since rndd = EE(e ,^^e ), we have shown that the right hand side of (2.28) coincides 
with the formula (2.16) for a 2 and therefore equation (2.17) is proven. 



□ 



2.4. The invariance principle. Next we turn to the proof of the full invariance principle. 
The main tool is a projection of the increments of our process {X^ ' w } n on martingale 
increments, to which the Martingale Invariance Principle can be applied. 
We start setting s(g) := Rge d , so that 

%?,J =< = W ~ K"-i , (2-32) 



cf. (1.4) and (2.1). Recalling the definition (2.19) of s v (g), we have s(g) = Yli=i s e i {d) e% ■ 
For n = 1 , 2 . . . we introduce the R -valued process 

Y n := s{ifl n ) - E[s(^, n )] ■ (2-33) 

We now show that, for P-a.e. w, 



supi^||E[F n+fc |j^ 



n>l 



,fc=0 



lL°°(P;l 



< OO 



(2.34) 



where we recall that T^ n := o"(vi, : m < i < n). Observe that E [i^+fc J-J 
KVM,n+k\Fl,n] ~ E Kn+fc]) and we can write 

E[< n+fe ] = VlnV[Pn + l,n+k\Kn] " E Win] E [*> 



E[^fc] 



n+l,n+fcj 



Since ||</?f, 



= K«-E[^, n ])E[^ 1>n+fc ]. 
E K,„]|| op <2, we have 

||E[y n+fc |^ n ]|| LOO(p . Rd) < 2||E[^ +1)n+fc ]|| op , 



(2.35) 



(2.36) 



hence (2.34) follows from Hypothesis 2.1 



We are now ready to prove the invariance principle. It is actually more convenient 
to redefine B^ ,ui (t), which was introduced in (1.5), as "7^-Xwy£p where [a\ £ NU {0} 
denotes the integer part of a. In this way, -B^' w (-) is a process with trajectories in the 
Skorohod space D([0, oo)) of cadlag functions, which is more suitable in order to apply the 
Martingale Invariance Principle. However, since the limit process aB has continuous paths, 
it is elementary to pass from convergence in distribution on D([0, oo)) to convergence on 
C([0, oo)), thus recovering the original statement of Theorem |1.6| 



Theorem 2.3. If Hypothesis 2.1 and Assumption 1.5 hold, then¥(duj)-a.s. and for every 
choice of vq the ~R d -valued process B V ^ )U converges in distribution on C([0,oo)) to aB, 



where B is a standard d- dimensional Brownian motion and a 2 is given by (1.6). 
Proof. Let us set for n > 1 



U n := ^^n+A^n-i] and Z n 



fc=0 



oo 

E 

fc=0 



E[y n+fc |^ n ] -E[F n+fc |^n-i]), (2-37) 



where we agree that Tf^ is the trivial <r-field. Note that U n and Z n are well-defined, 

>;M d ), for 



because by equation (2.34) the series in (2.37) converge in L 



-a.e. uj. The 
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basic observation is that E[Z n |J r f n _ 1 ] = 0, hence Z n is a martingale difference sequence, 
i.e., the process {T n } n >o defined by 

n 

T : = 0, T n := J^Zi, (2.38) 

i=l 

is a {^-"j^^jji— martingale (taking values in R ). Moreover we have by construction 

Y n = E[y n |^ n ] = Z n + (U n -U n+1 ), (2.39) 

that is Y n is just Z n plus a telescopic remainder. Therefore the process {T n } n is very close 
to the original process {Xn 0,UJ } n , because the variables Y n are nothing but the centered 



increments of the process {X^ ' u } n , see (2.32) and (2.33). 

For this reason, we start proving the invariance principle for the rescaled process T N = 
{r Ar (t)} iG [ 0iOO ) defined by T N (t) := ~7^T^ Nt j. By the Martingale Invariance Principle in 

the form given by [HI Corollary 3.24, Ch. VIII], the R -valued process T converges in law 
to aB, where a > and B denotes a standard R -valued Brownian motion, provided the 
following conditions are satisfied: 

(i) the (random) matrix (V n )ij = Y2=\ E [ ( e *> z k)(e j , Z k ) \ T xk ^ x ] , with 1 < i,j < d, 
is such that 

_V n ^-^ a 2 I d in P-probability ; (2.40) 

n 

(ii) the following integr ability condition holds: 

n 

-^E[|Z fe | 2 ;|Z fc |> ev q -2=^0. (2.41) 

k=l 

The second condition is trivial because the variables Z n are bounded, P(du;)-a.s.. The first 
condition requires more work. We first show that varp(- (V n )ij) ->0asn-> oo, for all 
i,j = l,...,d and for P-a.e. u>, and then we prove the convergence of E [~V n "\ . 

We start controlling the variance of V n . By definition (V n )i,j < ^((Vn)i,i + (V n )j,j), hence 
it suffices to show that varp(4 (V n )i t i) — > for every i = 1, . . . ,d. We observet that Z n 
has a nice explicit formula: 

Zn = s( < n _l (<n " E[< n ]) \Jt E &n+l,n+k] ] J , (2-42) 

where we agree that </>n+i n ^ s ^ ne identity operator on M. d (this convention will be used 
throughout the proof). Since s(g) = Yli=i s e i (d) e% i where s v (g) is defined in (2.19), a 
simple computation then yields 

E[(e\Z n ) 2 \^, n -i} = sf{K[Z n ®Z n \^ n _ x ]) = S f(K„-/ 2 e:) , (2.43) 



where we have applied (2.4) and (2.3) and we have set 



9£ := (E[< n ®<J-E[< n ]®E[<J) [j2nVn+l,n+k}®J2 E ^n+i,n+l\) • (2-44) 



\k=0 1=0 



DISORDERED SEMIFLEXIBLE POLYMERS 



13 



Applying (|2.43|) together with (|2.4|) and (|2.3|) we obtain 
1 



varp 



(y n )i,i 



n 



■E 



< 



2 
n 2 



Kfc<n 



E »S 2 («*-if 2 -E[K*-i)® 2 ])e! 



E 



KKKn 



34, E 



(K, fc -i) 02 -E[« fc -i) 02 ]) (2-45) 

® (K/-i)® 2 -e[(^_!)® 2 ])] (e? a efl) . 



Observe that by (2.3) we can write 

W^r - e[(^-iH = K lfe _i) 02 (^-i) 02 - vM,k-i)® 2 ]vM,i-i) 

r K fe -i) 82 -E[K,-i)® 2 ]) E [(^-i) 82 ] 



+ 



(^^{(^-ir-EW-i) 88 ]}, 

(2.46) 



and notice that the term inside the curly brackets is independent of J-f k _ l and vanishes 
when we take the expectation. Therefore we have 



E 



((vVi)® 2 " E[(^,-i) 02 ]) ® («/-i) 02 " E[(^-i) 02 ]) 



E 



(vtk-d^-vM^) 



521 ®2' 



(I^E^^)® 2 ]), 



(2.47) 



where we have applied again (2.3) and where I denotes the identity operator on £(I 



We can therefore rewrite the term in the sum in (|2.45|) as 

( lr W \®<X- T?\( lr V ^(g)2]®2 



CE 



/®E 



« e TlE 



(^-i) 84 -e[k w ; 



[^,w®^i])(er®ejf)) 

(/®E[^ iI _ 1 ®^_ 1 ])(ef®e^)) 



(2.48) 



where we have applied the first relation in (2.12) together with the following relations: 



err = er and E[^ i ,_ 1 ®^ i ,_ 1 ]e^r = E[^. 1 ®^. 1 ]e^ 1 (2.49) 

which follow from the fact t hat ( g ® g)T = g (& g for every g E £(R"). 



We know from Hypothesis 2.1 that when I S> k the operator E </?£;-! ® Pki-x \ ^ s d° se 



to II. Furthermore, if we replace E[c£>rz_i ® f%i-x ] ^y H inside (2.48) we get zero: in fact, 
since trivially <7® 2 n = II for every g 6 SO(d), we have 



E 



(V 



Lfc-ly 



E[K. 



521 ®2 



fc-1/ 



[J®n] 



(2.50) 



= E[K, fc -i) 02 »(« fc -i) 02 n)] - E[« >fc _ 1 ) 82 ]®E[(^ 1 ) 82 n] = o. 

So it remains to take into account the contribution of the error ^\f%i_i ^> f^l-x] ~ ^ 



inside (2.48). However, using Hypothesis 2.1 (|2.7|) and the triangle inequality, we have 
Wx,k-i> ~ X'YvPxfi-x) J 



E 



op 



< 2, lier^eni < 4(i + ch) 4 , (2.51) 
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hence, using the second relation in (2.12), from (2.45) and (2.48) we obtain 
f{Vn)ii\ 8(1 + CH) 4 ^ .. r 

Kk<n, m>0 



11 



n 



11- 



lop 



< 



[1 + C(u;)y 



n 



(2.52) 



having applied Hypothesis 2.1 again. We have therefore shown that varp(- (V n )ij) — » 
asn^oo, for all 1 < i,j < d and for P- almost every oj. 

It remains to prove that E[^ V n ] — ► a 2 1^ as n — > oo and to identify a 2 . Let us first 
note that by d2~32J) and (|2~33J) 



fc=i 



z 



«0,w 



E[X 



V( h W 



-'■ X n ■ 



(2.53) 



We also set Z l n := (e\ Z n ), X l n := (e% X n ) and 0* := (e\ U n ) for short. Since E^n^^] 
and in view of (2.39), we can write 



E[(K) y ] = Y,V[ZlZi] = £e[ZJZ?] = E[(Xi + W n+1 )(Xi + ui +1 )] (2.54) 

fc=l k,l=l 

(note that U\ = 0). We recall that by Proposition |2.2| we have asn-> oo, for P-a.e. u>, 

E[X l n X 3 n ] = (Covp(X^)). . = na 2 ^ + o(n) , (2.55) 

where <r 2 is given by (2.16) (equivalently by ( |1.6[ )). Since sup n ||tAi||z,°°(p ; ] 



< oo by 



(2.34), it follows from (2.54) that as n — ► oo, for P-a.e. u>, we have 



E[(K)y] = V[X l n Xi] + o(n) = nd 2 ^ + o(n) . 



(2.56) 



This completes the proof that the rescaled process T N = {T Ar (i)} ie r 0iOO ) converges in 
distribution as N — > oo to a 2 B, where a 2 is given by (|L6J). 



It finally remains to obtain the same statement for B^ ,w (t) := -7= Xfllf,. Notice that 



by (2.38), (2.39) and (2.53) we can write 



/n [Nty 



sup \\X k {h - T k \\ < sup ||E|_A fc u ' J || + sup \\U k+ i\ 



(2.57) 



where || • || denotes the Euclidean norm in R . However the right hand side is bounded 
in n in L°°(P;IR ), for P-a.e. u> (for the first term see (2.20) while for the second term 
we already know that sup n ||fn||z,°°(p ; R<*) < oo). Therefore sup te r 0M i \\B^ ,UJ (t) — T N (t)\\ < 



3 (p ; : 
(const.)/ yN for every M > 0, and the proof is completed. 



□ 



2.5. Proof of Theorem 1.6. We now show that the abstract condition expressed by 



Hypothesis 2.1 is a consequence of Assumption 1.4. In view of Theorem |2.3| this completes 



the proof of Theorem 1.6 

We start by controlling E [93^ n+fc ] , which is quite easy: the independence of the rj yields 

E [< n+fe ] = un E(n) wn+i E(n) • • • u n+k E(n) . (2.58) 



DISORDERED SEMIFLEXIBLE POLYMERS 15 



It is clear that ||E(ri)|| op < 1. We now show that Assumption 1.4 yields ||E(ri)|| p < 1, 



so that for every to we have 

oo oo oo 

EH E Kn + d|lop < EH E ( r l)1op < Ell E ^)Hop < »■ ( 2 - 59 ) 

k=l k=l k=l 

To prove that ||E(ri)|| op < 1, we argue by contradiction: if ||E(ri)|| op = 1 there would 
exist two vectors x,y G 5' d ~ 1 such that 

1 = (y,E(ri)x) = i (y,gx)Q(dg). (2.60) 

JSO(d) 

Since \{y,gx)\ < 1, for this equality to hold it is necessary that gx = y for Q-almost ev- 
ery g G SO(d). Setting V := {Xx : A G M} and W := {A y : A G R}, this would mean that 



gV = W for Q-almost every g G SO(d), which is in contradiction with Assumption 1.4 

Next we turn to the analysis of E [ ifP . fc © (p^ n+fc ] , which is a linear operator on 

the vector space £(R d ), equipped with the standard scalar product (f,u>)hs = Tr(v*w). 
We decompose C(M. d ) = Hi © H® © ff a as a sum of the orthogonal subspaces consisting 
respectively of the multiples of the identity, of the symmetric matrices with zero trace and 
of the antisymmetric matrices: 

H x := {\I d : AGE}, F s ° := {v G £{R d ) : V* = v and Tr(y) = 0} , 

H a := {v G £(M d ) : v* = -v} . 

All of these subspaces are invariant under g © g, for every g G £(R ), hence they are 
invariant under E [ y?^ , k © ^ n+fc 1 . We recall that II is the orthogonal projection on 



H±, cf. (2.13), while T is the orthogonal projection on Hi © H^, cf. (2.8). Since II and 

lop 



E [ <f%, n +k ® <n+fe ] are zero on #<* and the y coincide on Fi , 1 1 E [ ^ n+fc © ^ n+fc ] - n | 
is nothing but the operator norm of E[^ n+fc © (p^ n+k \ restricted to the subspace H®, 
therefore with obvious notation we can write for every u 



E ll E Kn +fc ® <„+*] " n ILp = E ll E «n +fe ® <n + *]Lo, op • ( 2 -61) 

fc=l fc=l 



However from (2.3) and from the fact that the r% are independent and identically dis- 
tributed we have 

E [<n+fc ® <n+fc] = K © o>„) E[n © rj • • • (w„ +fc © u n+k ) E[n © n] , (2.62) 
hence 

CO oo 

EII E Kn+^<„ +fc ] -n|| op < ECH^^Ikv/- ( 2 - 63 ) 

fe=l fc=l 

We are finally left with showing that ||E[n © ?"i] ||jyo op < 1. Let us assume by contra- 
diction that there exist v,w G H® with \\v\\h s = II^IUs = 1 such that 



1 = (w,E[ri®n}v) hs = / (w,gvg*) hs Q{dg). (2.64) 

JSO(d) 

However H^u <?*||hs = IMIhs = 1) hence (w,gv g*)hs < 1 an d we must have w = gvg* = 
gv g _1 for Q-a.e. g in SO(d). In particular, the matrices v and w are similar and therefore 
they have the same eigenvalues Ai, . . . , Afc, with k < d. Recall that by the spectral theorem 
v and w are diagonalizable. Denoting by K v and K w respectively the eigenspaces of v and 
w corresponding to Ai, we have that 1 < dim(K v ) = dim(K w ) < d — 1, where the last 
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inequality follows from the fact that v and w, having zero-trace and not being identically 
zero, cannot be multiples of the identity. Let us now fix g such that w = gv g~ l and take 
an arbitrary x £ g K v : since j _1 i£ K v we have 

wx = gvg~ x = g{\\g~ x) = Ai x , (2.65) 

which yields x £ K w . Therefore gK v C K w and since the two subspaces have the same 
dimension we must have g K v = K w , for Q-almost every g. This being in contradiction with 



Assumption 1.4 we have indeed that ||E[ri <g> ri] ||//o op < 1 and the proof of Theorem 1.6 
is completed. U 

3. Decay of correlation 

3.1. General notations. We denote by A the normalized Haar measure on SO(d). We 
recall that A is the only probability measure that is left- and right-invariant, i.e., such that 
X(Ag) = X(gA) = \(A) for all g £ SO(d) and (measurable) A C SO(d). In the special 
case d = 3, A describes a (random) rotation around the vector w of angle 9, where w is 
uniform on S 2 and 8 is uniform on [0, 2tt). For more on the Haar measure we refer to [3]. 

We recall that Q denotes the law of n. For fixed u>, we denote by L^ n the law of ^ n 
under P, so that for any bounded and measurable function F : SO{d) — ► R 

E[^«J] = / F(g)L^ n (dg). (3.1) 

JSO(d) 

We also set 

E"(k) := 2 sup ||L^ +1>n+fc - A|| T y , (3.2) 

n 

where the total variation (TV) distance between the probability measures \x and v is 
defined as ||// — z^||tv := sup^ \n{A) — v(A)\. We observe that ||/x — i/||tv coincides with 
2 supigi^ J gd/u — f gdv, in particular if /i is absolutely continuous with respect to v, with 
/ := dfi/du, we have ||/x — ^||tv = ^ / 1/ — 1| dv. 

3.2. Reminders of harmonic analysis on compact groups. Throughout this section, 
we assume that G is a compact topological group, equipped with the Borel cr-field, and A 
is the normalized Haar measure on G (of course we have in mind the specific case where 
G = SO(d), d > 2). We start recalling some basic facts about harmonic analysis on G, 
taking inspiration from [5l [6] . 

Given a (complex) Hilbert space H , a representation of G on H is a group homomor- 
phism U : G -»• B(iJ), i.e., C/( 5 /i) = U{g)U{h) for all g,h £ G, where B(iJ) denotes the 
set of bounded linear operators from i/ to itself. The representation U is said to be: 

• continuous if the map g \— ► (x, U(g)y) from G to C is continuous, for all x,y £ H; 

• irreducible if there is no closed subspace M oi H such that U{g)M C M for every 
g £ G, except the trivial case when M = {0} or M = H; 

• unitary if £/(#) is a unitary operator for every g £ G, i.e., (U(g)x, U(g)y) = (x, y) 
for all x,y £ H, where (•, •) denotes the scalar product in H (that we take skew- 
linear in the first argument and linear in the second). 

Finally, two representations U, U' of G on the Hilbert spaces H, H' are said to be equivalent 
if there exists a linear isometry T : H — > H' such that U(g) = T~ 1 U'(g)T for every g £ G. 
The set of equivalence classes of continuous, irreducible, unitary representations of G is 
denoted by S, which is a countable set (sometimes called the duai object of G). 
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We point out that, since G is compact, all irreducible representations are finite dimen- 
sional, that is, they act on a finite dimensional Hilbert space. Given a G E, we denote by U a 
an arbitrary representation in the class a, acting on the Hilbert space H a of finite dimen- 
sion d a G N. In each space H a we fix an (arbitrary) orthonormal basis {Co* = lj • • ■ j d a } 
and we denote by ufAg) = ((f,U a (g)(?) the matrix of U a (g) on this basis. Notice that 
ufA-) is a continuous function from G to C We have the following orthogonality relations, 
valid for all a, (3 G E, 1 < j, fc < d a; 1 < /, m < d^: 

<fe(fi , )«f m (fi')A(d5) = -^SapSjidkm, (3.3) 

G "a 

where x denotes the complex conjugate of x and 5ij is the Kronecker delta. Therefore 
{^/d^ufA■)} ae Y,,l<ij<d a is an orthonormal set in L 2 (G,dA). A crucial result is that it is 
also complete, i.e., the functions ufA-) span L 2 (G,dA), by the the Peter- Weil Theorem. 

Next we introduce the Fourier transform fi of a probability measure \i on G, which is 
the element of the space S := T\ a £Y;13(H a ) defined by 

£(a) := f U a (g) fi(dg) , aGE. (3.4) 

Jg 

More explicitly, ft(a) is the linear operator acting on H a whose matrix in the basis {Cf }i 

is given by p,(a)ij = J G ufj(g) M d s), for a G S and 1 < i, j < d 



It follows directly from the definition (3.4) and (2.5) that ||/t(a)|| p < 1 for every 



probability measure /ionG and for every a G E. As a matter of fact, when G is connected, 
this inequality is strict for a large class of //, as we show in the following lemma (where we 
denote by a = the trivial representation, with Hq = C and U°(g) = 1 for every g G G). 

Lemma 3.1. Let (j, be a probability measure on G with support V . Assume that V~ X V := 
{h~ 1 g : h,g G V} generates a dense set in G, i.e., the set \J^ =1 (V ~ 1 V) n is dense in G. 
Then ||/t(a)|| op < 1 for every a £ E, a / 0. 

Proof. Suppose that ||/t(a)|| op = 1. Then there must exist x,y G H a with ||x|| = ||y|| = 1 
such that (y,£i(a) x) = 1. Now 

1 = M(y, A(«) x) = j X(y, U a (g) x)u(dg) (3.5) 

The function r{g) = 3?(y, U a {g) x) is real and such that r{g) < 1, and so must be constant 
on the support of /j, and equal to 1. This implies that U a (g) x = y for any g £ V, hence 

U a {h- l g)x = U a {h~ l )U a {g)x = U a {h~ l )y = x (3.6) 

for all g, h G V. This means that the relation U a (g)x = x holds for all g G V" 1 ]^ and hence 
for all g G \J'^ ) = i(V~ 1 V) n . By assumption the latter set is dense in G and the continuity 
of the representation U a yields that U a {g)x = x for all g G G, which is impossible unless 
a is the trivial representation. □ 

We conclude this paragraph noting that the Fourier transform provides an easy tool to 
check whether a probability measure \i has an L? density with respect to the Haar measure 
A. More precisely, we have the following 

Lemma 3.2 (Fourier inversion theorem). A probability measure [i on G is such that 

Y,d a \\Ma)\\l < oo (3.7) 
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if and only if it is absolutely continuous with respect to A with density in L 2 (G,d\). In 
this case, the density f = d^/dA is given by 



fig) = Y,d a Tr(il(a)U a (gy 






Yl d *K®)i,jufj(g), 



(3.8) 



where the series converges in L 2 (G,d\). 



Proof. Since {yfa\ u",(-)}aes, i<i,j<d a is a complete orthonormal set in L 2 (G,dA), the 



same is true if we replace ufA-) by uf-(-), therefore condition (3.7) guarantees that the 
right hand side of (3.8) does define a function / £ L 2 (G, dA). Consider then the (a priori 



complex) measure dz/ := /dA. Using (3.3) it is easy to check that 



H<*)ij : = / u i,j(9) "(dg) = fi(a)ij 



(3.9) 



G 



for all a £ S and 1 < i, j < d a . By Theorem (27.42) of [6: this implies that jjl = v. Vice 
versa, if a function / is in L 2 (G, dA), the right hand side of (3.8 ) is nothing but its Fourier 



series in the orthonormal set {v / 3^ , uf 7 -(-)}ae£, i<i,j<d a , hence relation (3.7) holds true. 



Finally, the second equality in (3.8) is easily checked. 



□ 



3.3. Exponential decay of the total variation norm. In this subsection we need to 
assume that G is also connected (which is of course the case for G = SO(d)). We show 
that, assuming hypothesis H-[I]or hypothesis H-|2](cf. § 1.4), for P-a.e. u>, we have 



X>"(fc) < oo, 



(3.10) 



fceN 



where we recall that E^ik) has been introduced in (3.2 ). As a matter of fact, we are going 



to prove the much stronger result that there exist positive constants c\ , C2 such that 

sup E u {k) < ci e~ C2k , for all k € N. (3.11) 



It is convenient to introduce the convolution /x * v of two probability laws /i,i/onG by 

(n*u)(A) := [ »(Ah- l )v(dh) = [ u(g- 1 A) l x(dg), (3.12) 

Jg Jg 

so that if X, Y are two independent random elements of G with marginal laws //, u, then 
fi * v is the law of XY . Therefore we can express L^ n as 



L 



m,n 



Q*S t 



i^m + l 



Q*...*8u n *Q, 



(3.13) 



where 5 g denotes the Dirac mass at g £ G. We stress that in general the convolution is 
not commutative. A basic property is that \x * v{a) = fi{ot) v(a) for every a G E, or more 
explicitly // * v{a)i^ 



Sfc=i fi( a )i,kv(a)kj, as one easily checks from (3.4). 



In the next crucial lemma we give an explicit bound on E u {k) in terms of the Fourier 
transform Q of Q. We recall that we denote by a = the trivial representation. 



Lemma 3.3. The following relation holds true for every k G N: 

< Yl cUQ(a)||Ll|Q(«)|| 

aeS,a^0 



sup E"{k) 



2(fe-l) 
op 



(3.14) 
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Proof. From (3.13) we can write 



L 



n+l,n+fc 

and using the inequalities in ( |2.6| ) we get 

\\L 



(a) = U a (u n+1 )Q(a) ••• U a (u; n+k ) Q(a) , 



(3.15) 



„ + i,„ +fc («)||L < l|0(a)llLHO(a)fi*- 1) nil l[7tt ( w ^)ll?P ^ IIQHUhsllQHIl^-^ 



(3.16) 

where we used that ||^" a (c^n-H)||o p = 1 because the representation is unitary. Now as- 
sume t hat t he right hand side of (3.14) is finite (otherwise there is nothing to prove). By 
Lemma 3.2 L^ , 1 n+k has a density / e L 2 (G, dA) with respect to A, therefore by Jensen's 
inequality we can write 

4IIL! 



J n+l,n+fc" 



XII 2 



l/-l|dA < 



G 



fdX 



a 



Yl d « n3 



n+l,n+fe 



(«)ll 



2 

hs i 



aSS,a^0 



(3.17) 
where in the last equality we have used Parseval's identity, observing that (/, u".-) = 

(■^n+l n+fc( a ))i • an d that trivially j£t(0) = 1 for every probability measure \x on G. Recall- 
ing the definition (3.2) of E UJ {k), relation (3.14) is proven. □ 

Proof of ( |3.1ip under Hypothesis H^ Let us set / := dQ/dX £ L 2 (G,dX). By Parseval's 
identity we have 



f fdX = X> a ||Q(a)||L < oo. 

'' G aeE 



(3.18) 



In particular, for every e > 0, ||Q(a)[|hs < £ for every a ^ T, with V a finite subset of E. 
Since ||Q(a)|| p < ||Q(a)||hsi we have that [|Q(a)|| p < e for every a G E, a I\ Next 
observe that Lemma |3.1| can be applied, because by hypothesis the support of Q contains 
a non-empty open set A, hence A -1 A is open too and therefore it generates the whole G 
(it is easily seen that, for any non-empty open subset B, U^Li B n is non-empty and both 
open and closed, hence it must be the whole G, which is connected). This observation 
yields 

h := sup HQHHop < 1. (3.19) 

aSS,Q^0 
Therefore from Lemma 13.31 we have that 



sup E u (k) < 



■ z^- 1 ) 



(3.20) 



which proves (3.11) under hypothesis H-J2J □ 

Proof of (3.11) under Hypothesis HH\ Since the law Q is assumed to be conjugation 
invariant, we have f G f(g) Q(dg) = f G f(t~ l gt) Q(dg), for every t £ G. Then for any law 
v on G and for any bounded measurable function / : G -» M we have 



fd{Q*v) 



G 



f(gh) Q{dg) v{dh) 



G JG 



G JG 



f{hg) Q(dg) u{dh) 



[ fd(v*Q), 

JG 

(3.21) 

hence Q*v = v*Q. In particular, taking v = 8 g , the operator Q(a) commutes with U a (g), 

for every g £ G. Schur lemma then yields that Q{a) is a multiple of the identity I a on 
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H a : Q(a) = c a I a for c a G C. Then from (J3.15 ) it follows that 



L 



n+l,n+fe 



(a) = U a (uj n+ i---uj n+k )Q(a) k 



(3.22) 



hence H^+ln+fcWIIhs = IIQ( a ) fc llhs- Since by assumption for k > no the measure Q* k 
has a density /& := dQ* fc /dA G L 2 (G,dA), it follows that also L^ +ln+k (a) has a density 
ffnfc = d^ + i n+ fc/dA G L 2 (G,dA) (cf. Lemma 3.2) and by Parseval's identity we have 



G 



(91 



n.kJ 



dX 



Kfeiil = ii Ai = E^IWIL < °°- 



«eE 



Arguing as above and recalling that Q(a) = c a I a , it follows that (3.19) still holds. We 
therefore have for k > uq 

% 2 
4 ll L n+l,n+fc( a ) - A IItV < 



G' 



< fe - l|dA < / « fc )^dA 



G 



£ d a \\Q(a)% s < £ d 4Q^) n ^ s \\Q( a )C~ n0) = WUoWl-h 2 ^. 



«eE,o^o 



a£S,a^0 



Then sup^ E UJ (k) < ||/n ||2 h k n ° and the proof of equation (3.11 ) is complete. 



(3.23) 

□ 



Proof of Proposition \l.8\ It suffices to prove that for every n and every uj we have a w (ri) < 
2E u (n). Since {(pi t n}n is a (inhomogeneous) Markov process we directly see that 



a"(n) < sup|E[n(^, m )^« m+ J]-E[w(^ m )]E^« m+ n)]! 

u,w 

where u and w vary in the set of measurable maps from G to [0, 1]. Since 



(3.24) 



+ 



«(s) ( Iw(gg') (L^ +lm+n (dg') - X(dg'))\ L? >m (dg) 

w(gg') {L^ +1>m+n (dg') - X(dg'))) L<£ >m (dg) 



G \JG 

u{g)Ll m {dg) 

'G \JG 

the desired bound follows since both \u(-)\ and \w(-)\ are bounded by 1. 



(3.25) 

□ 



Appendix A. The elementary approach to the two-dimensional case 



We give here a partial proof of Theorem 1.6 in the 2-dimensional case. We identify in 
particular the variance <r 2 , cf. (1.6), of the limit process. We set T := 



IttTj) and we 
denote by R a the rotation by an angle a. With reference to (1.4), we write ujj = Rj. and 



r; 



Rg., with 7j and 9j random variables taking values in T. The Fourier coefficients of 
the law Q of 6\ are j^e imx Q{dx) =: q m , for m £ Z. Recall that we are assuming that 
Q({0o, $o + 7r}) < 1 for every 9q and this is equivalent to \q n \ < 1 for n = 1 and n = 2. 

We set 6„ := 6\ + . . . + 6 n and r n := 71 + . . . + 7„ for n £ N, along with $ n :=T n + @ n . 
Therefore the real and complex part of the random variable Zfj := e 1 + . . ,+e N coincide 
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with the components of the random vector XJ?' , vq = (1,0). Our goal is to compute the 
asymptotic covariance matrix of X v ^ ,u) as N — » oo. Note that no centering is needed, since 

N N 

E[Z%] = Y, eiTm E [e i0m ] = 5^"^™' (A.l) 

m=l m=l 

and therefore |E[Z/y]| < |<Zi|/(l — |<Zi|) < °o, because |<7i| < 1. 

We can therefore focus on the second moments. For simplicity, we fix an arbitrary 
direction e^° in M. 2 ~ C, with ft £ T, and we look at the projection {Zn ' } n of the 
process {Z%} n in this direction, i.e. 

Z^° := , Z^° := cos($ 1 - ft) + . • - + cos($„ - ft) . (A.2) 

For n G N and m G N U {0} one directly computes with x = m + T m+n — ft 

E[cos($ n+m -ft)|9 m ] = E[cos(x+e n )] = $l($e ix ) = |gi| n cos(e m +0rt+r m+n -ft) , 

(A.3) 
where 6 is such that e = qi/\qi\- We observe also that for a, b £ T 

E [cos(e m + a) cos(9 m + b)] = ± cos(a -&) + -« (<gV ( ° +6) ) , (A.4) 

and from (A.3) and (A3) we directly see that 

E [cos($ m - ft) cos($„ +m - ft)] = 

i|«i| n {cos(r m+n - T m + 5n) + H ^ e *(r m+ n+r»+^-2f )^ j , (A . 5 ) 

and the latter expression actually holds also for n = 0. We are now ready to estimate 
E[(Z|y ,w ) 2 ]. The expression contains diagonal terms and for those we have 

N N 

]TE[cos 2 (<i> m -ft)] = - +o(N), (A.6) 

771=1 

by (A. 5) with n = (recall that |^| < !)• The off-diagonal terms instead give 

;\r i at ™ at 1 at ~ 



N-lN-m JV-1 JV-n 

2^ ^ E[cos($ m -ft)cos($„ +m -ft)] = 53 q? J]cos(r m+n -r m + en) + o(AT) . 

m=l n=l n=l m=l 

For every fixed n £ N, by the Ergodic Theorem we have that P(dw)-a.s. as N — > oo 

JV-ra 

J3 cos(r m+n - T m + On) = E(cos(r„ + On)) ■ N + o(A) , (A.8) 

m=l 



m=l 

and therefore that P-a.s. 

JV-1 N-n 



JV-1 JV-n / oo \ 

53 (ff 53 cos(r m+n - T m + 0n) = I 53 <?? E (cos(r n + On)) \N + o(N) , (A.9) 

n=l m=l \ n=l / 

so that finally we have P(dw)— a.s. 

JV oo 

-E[(Z^ ) 2 ] = - 53 E(cos(^-ft)cos(^-ft)) ^ - + 53|g 1 rE[cos(r„,+0n)] , 

i,j=l n=l 
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which matches with (1.6). Note that the diffusion coefficient is independent of the direction 



£o and that it depends on the law of 9\ just through the first Fourier coefficient q\. 



Appendix B. The homogeneous case 



The aim of this appendix is to argue that, if disorder is absent, Theorem 1.6 holds under 
the assumption that the support of Q generates a dense set in SO(d). 

In order to do this, let us first observe that, when disorder is absent, we can weaken 
Assumption |1.4| to the following generalized condition: there exist m £ N such that 



(E(ri))' 



lop 



< 1 



and 



(E(ri®ri)) ? 



IflJ.op 



< 1 



(B.l) 



where we recall that H® denotes the space of symmetric real matrices with zero trace. 



We have shown in £2.5 that this condition with m = 1 follows from Assumption 1.4 The 



fact that, when disorder is absent, equation (B.l) is sufficient to yield Hypothesis 2.1 and 



hence Theorem |1.6| is immediately checked: for instance, by (|2.58|) we can write 

oo 

|E[< n+fe ]|L < ^||(E(n)) T 



oo 

E 

fc=i 



I \k/m\ 
lop 



< OO . 



(B.2) 



k=l 



and analogously one shows that J2T=1 ll E [^ in+ fc ® fn,n+k] ~ n ll op < °°' cf- d 2 - 63 D - 



We recall that, for an arbitrary linear operator A on some vector space and for any fixed 
operator norm || • ||, the sequence ||yl m || 1 ' m converges as m — ► oo toward the spectral radius 



of A, denoted Sp(A). Furthermore, by sub-additivity (since || J 4 m+n | 
we have Sp(j4) = inf m£ N [|>l m [| 1 ' m , hence we can restate (B.l ) as 

s P (E(n)) 



op 



< \\A r 



lop 



\A r - 



lop; 



< 1, 



Sp(E(n®ri)|flj) < 1. 



(B.3) 



Let us finally show that equation (B.3) is satisfied whenever the support V of Q gener- 
ates a dense set in SO(d), i.e., whenever the closure of IJfcez V k ls t ne whole SO(d), where 
we set V^ 1 := {g^ 1 : g £ V}, V 2 := {g h : g,h £ V}, and so on. Since this fact is easily 
checked for d = 2, in the following we assume that d > 3. 

We argue by contradiction: if the spectral radius of E(ri) is equal to one, there exists 
v £ C d with ||i>|| = 1 such that E(n) v = e %e v, with £ [0, 2ir), hence 



1 = $t(e l %,E{ ri )v) 



SO{d) 



®(e ie v,gv)Q(dg). 



(B.4) 



In the preceding relations we have denoted by (•, •) the standard Hermitian product on C , 
i.e., (a, b) := J2k=i ®k fyfcj where a denotes the complex conjugate of a. Since ^R.(e l9 v, gv) < 1 
for every g € SO(d), we must have gv = e ie v for every g G V, the support of Q. Writing 
V\ + i : ,V2 with v i , v 2 £ M rf and denoting by U the linear subspace of M d spanned by v\ , V2, 
it follows that gU = U for every g £ V. Since by assumption V generates a dense set 
in SO(d), by continuity we must have gU = U for every g G SO(d), which is clearly 
impossible because 1 < dim(C/) < 2 (recall that we assume d > 3). 

With analogous arguments, if the spectral radius of E(n (8 7*1) on the space H® equals 
one, there must exist v\,V2 £ H® with ||vi||^g + ll^llhs = 1 an< ^ ^ e [0,27r) such that 
g(i>i + iv2)g~ x = e ld (v\ + IV2), for every g £ V. Denoting by U the linear subspace of 
H® spanned by v\,V2, it follows that gU g~ l = U for every g £ V. Since by assumption 
V generates a dense set in SO(d), by continuity we must have gU g~ l = U for every 
g £ SO(d). However this is not possible, because the only linear subspaces W such that 
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gW g^ 1 C W for every g G SO(d) are VF = {0} and VF = H® (i.e., the representation 
SO(d) B g ^-> g ® g on the vector space i^ is irreducible). 

Let us check this fact. We take w G W not identically zero: by the spectral theorem, 
there exists g G SO(d) such that v := gwg -1 £ W is diagonal: Vij = XiSij. Since v 
is not identically zero and it has zero trace, there exist io,jo such that \i ^ Xj . Let 
us now take h G SO(d) to be the matrix that permutes the coordinates io and jo, i.e., 
hij := <5ij for i, j G" {io, jo} while hi j = hji := 5j j and hij = hj i := Su . It is clear that 
v := hvh -1 € W is such that v^ = XiS^, where Aj = Aj for i G" {io>jo} while Aj = A J0 
and A J0 = Aj . Therefore z := (A , ^ A J v — v) G W is such that Zi i = 1, 2j 0J0 = — 1, 

and Zjj = for all the other values of i,j. By considering gx g^ 1 , where g G SO(d) is an 
arbitrary permutation matrix, we obtain all the matrices defined like z but with arbitrary 
io, Jo- These matrices span the linear subspace consisting of all the diagonal matrices with 
zero trace, which are therefore contained in W. However, again by the spectral theorem, 
for any matrix u G H® we can find g G SO(d) such that gug^ 1 is diagonal with zero trace, 
hence we must have W = H® and the proof is completed. 
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